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Abstract 

We investigate the use of network coding for information dissemination over a wireless network. 
Using network coding allows for a simple, distributed and robust algorithm where nodes do not need 



(N 
> 

any information from their neighbors. In this paper, we analyze the time needed to diffuse information 
throughout a network when network coding is implemented at all nodes. We then provide an upper 



bound for the dissemination time for ad-hoc networks with general topology. Moreover, we derive a 
relation between dissemination time and the size of the wireless network. It is shown that for a wireless 
network with N nodes, the dissemination latency is between O(N) and 0(N 2 ), depending on the 
reception probabilities of the nodes. These observations are validated by the simulation results. 



I. Introduction 

The information dissemination problem, at its root, is a classical broadcast problem: sharing 
data residing at one node (source) with all others (destinations) in the network. The significance 
of this problem can be readily gauged by the extensive history of prior work on this topic, which 
has contributed to the design of networked communications with protocol stacks that support 
efficient broadcasting. 

Of late, a more modern version of the one-to-many (broadcast) problem has gained promi- 
nence; this is typically referred to as a data sharing among multiple peer-to-peer (p2p) nodes, 
or the all-to-all problem. This problem arises when each node in a network obtains only a 
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fraction of the total information (e.g. part of a video-on-demand file or a software update) 
desired collectively by all. In a simplified version of the all-to-all data-dissemination problem, 
a source file desired by all, is divided into N mutually exclusive information packets, and each 
packet is stored at exactly one node in the network [1], [2], Every node wishes to acquire the 
remaining N — 1 pieces of the source file; the order in which each node receives the remaining 
information packets is not relevant. 

Traditionally, the data dissemination problem over a decentralized network architecture has 
focussed on the impact of the dissemination algorithm designed to optimize a performance metric 
such as the dissemination latency, i.e., the time required for all nodes to acquire the entire file 
[3]. Our contribution in this paper is to analyze the impact of network coding to this problem. 
One of the most significant benefits of our approach is that nodes do not need extra information 
from other nodes regarding the state of the network [4] to accomplish dissemination, significantly 
reducing communication overhead. 

The prior analytical models used for this problem have largely suffered from unrealistic 
assumptions that are unsuited to a wireless network - notably that of pure fail/success (0 — 1) 
whereby either each transmission is either successfully received by all neighbors or fails. Our 
analysis advances the state of the art by using a more appropriate link model whereby for each 
broadcast, sink nodes successfully receive the transmitted packet with a reception probability that 
depends on the nodes respective locations. Note further that in all broadcast wireless networks, 
the role of the multiple access or MAC protocol is fundamental to managing interference. In 
our work, we assume orthogonal access, such as random TDMA, thereby obviating the need to 
model multi-access interference. Such a set-up is admittedly limited but nonetheless useful, as 
it encompasses practical network scenarios such as a single-cell CSMA/CA [5], [6]. 

Finally, we emphasize that we only focus on the dissemination latency (i.e., the time needed for 
all nodes to communicate and receive the entire file) and do not consider the latency caused by 
encoding/decoding which has been studied separately in the literature [7]. In fact, [8], [9] explores 
design of a sparse network coding matrix that decreases encoding/decoding time significantly. 
Clearly, the net latency of data dissemination is the sum of our result and the encoding/decoding 
time. 

The rest of the paper is organized as follows: Section II summarizes related works in data 
dissemination. Then, Section III describes the system model and basic assumptions used in 
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this paper. The gossip-based data dissemination using network coding in wireless networks is 
introduced in Section IV. In Section V, we derive an upper bound on dissemination latency. 
Performance evaluations are presented in Section VI and the paper concludes with suggestions 
for future work in Section VII. 

Notations: Bold capitals (e.g. A) represent matrices and bold lowercase symbols (e.g. m) 
denote vectors. The i-th entry of a vector m is denoted by m 8 and superscript T denotes matrix 
transpose. The number of i-combinations from a given set of N elements, when order doesn't 
matter, is denoted by C(N, i). The cardinality of a set S is defined as number of elements S has 
and it is denoted by \S\. The equality between two subspaces Si and S 2 is denoted by S± = £2. 

II. Related Works 
A. Data Dissemination in Wireless Networks 

Data dissemination initially among wired clients first emerged as an appealing alternative to the 
traditional client-server model [10]. Due to the exponential growth of wireless networks, there has 
been considerable interest in the sharing problem between wireless nodes (such as cellular phones 
or PDAs) that are interconnected wirelessly. Clearly, data dissemination in wireless networks 
differs from the wired scenario in significant ways - the broadcast nature of wireless contributes 
(potentially) to multi-access interference. As a result, simultaneous reception of multiple packets 
in a wireless network is typically limited. Furthermore, wireless links are half-duplex due to 
hardware constraints; i.e., a node may not simultaneously transmit and receive due to the lack 
of sufficient isolation between the two paths. 

Despite these differences, gossip-based protocols - originally applied for dissemination over 
wired networks - have remained as the algorithmic family of choice in wireless networks for 
dissemination, as they inherit the desired balance between simplicity and robustness [11]. Some 
modifications have been applied to make these protocols consistent with wireless properties. For 
instance, when a node transmits a packet in a wireless network, every node within its transmission 
range receives the packet [12], [13] in contrast to to gossip algorithms in wired networks that are 
based on unicast communication with a randomly chosen neighbor. However, the main feature 
of gossip-based algorithms is preserved - every node in the network acts simply based on its 
local state information. 
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B. Network Coding for Dissemination 

A relatively new approach to decentralized data dissemination is based on network coding, 
first introduced for multicasting in wired networks [14], [15], [16], [17], [18]. Authors in [19] 
exploit random (linear) network coding within a gossip-based dissemination protocol for wired 
networks and show that, in a fully connected network, network coding can improve dissemination 
latency at the cost of a small overhead associated with each packet. In fact, they show that, in a 
network with N nodes, data can be diffused in O(N) time using a coding-based gossip algorithm 
rather than O(NlogN) without network coding. Mosk-Aoyama and Shah extend the result in 
[19] to a wired network with arbitrary topology [20] and show how the graph topology affects 
the performance of the coding-based dissemination algorithm. 

For wireless networks, network coding was first exploited for wireless broadcasting - treated 
as a one-to-many data-dissemination problem - using simple XORs [21] that demonstrated 
bandwidth efficiency over traditional wireless broadcast approaches. Recently, [22] used network 
coding within a gossip-based algorithm (rumor- spreading) to diffuse information in an ad-hoc 
wireless network. This was continued by [23] who studied performance of such dissemination 
due to real- world MAC schemes. 

There is a plausibility argument as to why network coding provides substantial benefits for data 
dissemination. As mentioned before, gossip algorithms impose restrictions on the information 
possessed by a node; i.e., every node has only a local view of the system state at any time. 
This is analogous to the initial state in our scenario, where each node has only a (small) unique 
part of the full file, and seeks to gather the other pieces. With time, a node gathers some of 
the other pieces but does not have any information about what pieces the neighboring nodes 
may possess. It is natural that, at any time, different nodes in the network will have acquired 
(at least some) non- overlapping pieces. Intuitively, this suggests that if each node encodes all 
the data it presently contains (via network coding) and broadcasts it, recipient nodes will have 
acquired coded versions about the missing pieces. In time, each node will be able to decode the 
full file, after a sufficient number of such encoded packet transmissions from other nodes, as 
subsequently described. 
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III. System Model 

As usual, a network graph is denoted as G(V, E), with \V\ = N nodes and links E C V xV. 
We assume that the network is slotted (i.e., all nodes are synchronized) for simplicity and that 
all transmissions occur synchronously with a common clock. Further, without loss of generality, 
we assume that during each time slot, a node v E V can broadcast exactly one packet. When 
node v broadcasts, node u E V receives the signal correctly with probability P vu . We consider 
an interference-free (orthogonal) access that allows only one node to transmit at a time. This 
includes, among others, a single-cell 802. 11 -type infrastructure network based on CSMA/CA if 
all nodes lie within the (common) carrier sensing range. The probability of a node capturing 
the common channel at any time is assumed to be uniform among all nodes. Finally, we only 
consider networks with fixed topology, i.e., where the topology and link capacities do not change 
over time. 

IV. Data Dissemination using Network Coding: Arbitrary Network Topology 

Assume that each node u initially has a single information packet x u to be shared with 
every other node in the network. Hence, the set of unique (information) packets in the network, 
initially and at all subsequent times, is given by {xi, . . . , x^} for a network with N nodes. Each 
information packet is a vector of r symbols, where each symbol is an element of a finite field 
F 2 g, i.e., x u E ¥ r 2q for each node u E V . For convenience, assume that q divides the length 
of packets transmitted (otherwise, zero padding is applied). Moreover, all packets are linearly 
independent vectors in F 29 , reflecting the fact that nodes have different information to share 1 . 
The results and derivations presented in this paper can be extended to a case when some nodes 
have more than one message and some have none or when all the messages are there with one 
particular node to start with. 

With time, each node receives a sequence of linear combinations of information packets at 
the other nodes. Hence, after a sequence of broadcasts, node u E V possesses a set of coded 
messages, S u (t) at time (slot) t. 

S v (t) = {mi,m2,...,m\ Su (t)\}, (1) 

'For more information about linear independence in finite fields, refer to [24], [25]. 
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Each message rrii is a linear combination of the underlying information packets 2 , initially 
possessed by the nodes and can be represented as 

N 

m i = y^ j a it kXk = afx, i = 1, 2, ..., \S u (t)\, (2) 

fe=i 
where some of the coefficients a iy k may be zero (if the corresponding information packet is not 

present at the transmitting node at that time). For each message rrii, a^s are called its network 

coding coefficients, and they are available through the header of the packet containing m,. As 

discussed in [26], in a network coding system, each packet consists of two parts: a header that 

contains the network coding coefficients and a body that carries the encoded message. This 

header is a price to pay to use the network coding. However, if the size of the information 

packets (and hence the size of the messages) is reasonably large, this overhead is negligible. 

That being said, for each message m, at node u, network coding coefficients are available. 

Clearly, S u (t) (set of messages at node u at time t) spans a subspace in ¥ r 2q , as observed by 

rewriting Eq. (2) in the following form: 

m u (i) = A tt (t) x, (3) 

where A u is the coefficient matrix consisting of NC coefficients and x is the iV-vector of all 
the information packets in the network, given by 



(4) 



x = [ 


Xi x 2 ... 


xn\ , 




m u (t) = [mi m 2 . . 


• m \s u (t)\] T , 
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A u (t) = 










a \S u (t)\,i 


a \Su(t)\,2 ■ ■ ■ 


a \S u (t)\,N 



is defined by a^j, the j-th coefficient used in the i-th message m« received by node u. 

From the received broadcast messages, a node can decode all the information packets at time 
t if the rank of its coefficient matrix, A u (t), equals N, the total number of unique information 
packets in the network. In the next section, we use this observation to put an upper bound on 
dissemination latency in the network. 

2 Recall that each node u starts with a single information packet x u to share with the rest of the network. 
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If node u captures the channel at any time slot t, it transmits a linear combination of all the 
messages that it currently contains, as follows: 

\Su(t)\ 

Vuit) = ^ Pi m ii ( 5 ) 

i=l 

where y u (t) denotes a transmitted packet at time t, S u (t) is a set of messages node u contains 
at time t, and /%, i — 1,2, ..., |,S U (£)|, are random finite numbers selected from F 2 «. Using (3), 
the transmitted message by node u at time £ can be written in terms of information packets as 
follows: 

y u (t) = (3m u (t-l) (6) 



(3m u (t - 


-1) 


PA u (t - 


-l)x 


ex'x, 





where a' = /3A M (t — 1) presents network coding coefficients of information packets used by 
node u to transmit a message at time t. As mentioned before, these coefficient are send a long 
with y u (t). 

V. Stopping Time 

The data dissemination algorithm terminates when all nodes are able to decode the broadcast 
messages to recover the underlying N set of information packets, which happens when Eq. (3) 
for all u E V has a unique solution, i.e., when the coefficient matrix at each node has full 
rank N. Hence, the stopping time T is defined as the first time when the full rank condition is 
achieved: 

T = min{ran£;(A M (£)) = TV Mu e V}. (7) 

Because matrix A u has rank TV if and only if S u (t), the subspace scanned by messages in u at 
time t, has dimension N . The stopping time can also be defined as follows: 

T = min{dim(S u (t)) = JVVmG V}. (8) 

Clearly, T is an integer random variable over [N, oo) 3 . We next seek the expected value E[T] 
as a performance metric for algorithm design. In general, E[T] is difficult to compute; hence, 
we resort to bounds. 

3 To diffuse N packets, we need at least TV transmissions, which in wireless networks require at least N time slots. 
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A. upper bound for Mean Stopping Time 

As mentioned above, S u (t) is the set of packets that node u possesses at time t. Each packet 
is an r vector in a finite field F 2 <? (i.e., it belongs to F^). Moreover, we assume that packets in 
S u (t) are independent. Hence, the cardinality of S u (t) is the same as its dimension. 

By our formulation, every node initially starts with an information packet. In other words, at 
t — 0, there is one and only one (independent) packet in S u (0), i.e., dim(S u (0)) = 1 Vu G V. 
With time, the information spreads to all nodes upon sharing via broadcast, resulting in a final 
per node dimension of N at the time of stopping. Hence, each node dimension is raised by 
N — 1 during the information dissemination, and the overall dimension increase among all the 
nodes is N(N — 1). Let us define D(t) as the total dimension increase (among all the nodes) at 
time t. Obviously, D(t) can be written as follows: 

D(t) = J2 dim(S u (t)) - N. (9) 

uev 

Clearly, the information has spread to all nodes when D[t) = N(N — 1). Now, let % denote 
the number of time slots until the total dimension increases by i(N — 1). It can be written as 
follows: 

Ti = min{D(t)>i(N-l)}. (10) 

By definition % = and the information spreads to all the nodes at T/v, i.e., T = Tn- 

The following lemma gives an upper bound for the probability of the message sets at two 
nodes spanning the same subspace when t — %. 

Lemma l.Att — %, the probability that any two nodes (e.g., u, v) have the same subspace can 
be bounded from above as follows 4 : 

P(S U (%) = S v {Ti)) < Y, mm I ' l - J P{dim{S v {t)) = k + 1). (11) 

fc=i ^ ' 

Proof: See appendix. 

4 Note that in Eq. 25 (and in similar cases henceforth), S u (t) = S v (t) indicates the equality between two subspaces, the one 
spanned by elements in S u and the one spanned by elements in S v . 
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Intuitively, the probability that two nodes have the same subspace at % is calculated using 
the law of total probability: probability that one of them has dimension k multiplied by the 
probability that the other one has the same subspace. The latter can be calculated by recalling 
that % represents the number of time slots at which the total dimension increase is i(N — 1). 
We next explore the dimension of node u E V at time %. The following lemma gives an upper 
bound on the probability of node u having dimension k at %. 

Lemma 2. The probability that node u has dimension k at time % is given by: 

P(dim(S u (7l)) = k)= (12) 

E J= o(-^yC(N - l,j)C(i(N -l) + N-2-jN-k,N-2) 
E,=o(-l) J CW j)C(i(N -i) + N -l-jN,N-l) ' 

Proof: See appendix. 

By definition, at time %, the dimension has increased by i(N — 1) and this comes from TV 
nodes. Now the probability that one node has dimension increase k is in fact the traditional ball 
and bins problem. 

The following theorem provides an upper bound on expected number of time slots between 
71 and 77+i- 

Theorem 1. For < i < N - 2: 

N - 1 

2Jv(2^«,«ev- P ™)( 1 _ P(S u (T i+ i) = S v (Ti+i))) 
and for i = N — 1 

E[T N - Tn-i] < Jf^'^p v d4) 

2N\2-/u,veV * uv ! 

Proof: See appendix. 

Finally, the following theorem, which is the main contribution of this paper, gives an upper 
bound on the stopping time T. 

Theorem 2. Let T be stopping time. Then 

E[71 = E[T N ] (15) 

< N ~ x (y l 

2JV Lu,»eF Puv \ i=1 1 — P{S u (7i) = S v (Ti)) 
March 1, 2013 DRAFT 
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Proof: See appendix. 

Note that to numerically calculate the upper bound given in Eq. 15, we need to combine 
the results in Eqs. 25 and 12. Since each node is initialized with a single packet, it needs to 
acquire the remaining N — 1 packets from other nodes for the process to terminate. Hence a 
total of N(N — 1) successful packet transmissions must occur. Due to the broadcast nature of 
wireless, multiple receive nodes hear each transmission and may decode the transmitted packet 
(according to their reception probability; higher reception probability results in a higher chance 
of decoding). Therefore, the number of time slots required is inversely proportional to reception 
probability as captured by the first part of the upper bound. The second part of the upper bound 
represents the fact that a successfully received packet v at a node is only useful if it does not 
belong to the subspace spanned by existing packets; i.e., it is 'innovative'. Again intuitively, the 
probability of a packet being innovative at a node decreases with time, as the subspace spanned 
by existing packets is always monotonic non-decreasing. 

In an interference-free network where only one node transmits in each time slot, Theorem 2 
gives an upper bound on dissemination latency when NC is used. As expected, the upper bound 
depends on reception probabilities and the number of nodes inside the network. In the following 
two lemmas, we consider two extreme cases, a fully connected wireless network and a sparsely 
connected network. These two cases illustrate how reception probability affects dissemination 
based on NC in a wireless network. We show that no matter what the reception probabilities are, 
the average stopping time is between O(N) and 0(N 2 ) as long as the network stays connected. 

A wireless network G(V, E) is considered fully connected when for every two nodes u,v E V, 
P uv > 0. In other words, in a fully connected network, every node is within the transmission 
range of all the others. 

Lemma 3. In a fully connected wireless network in which every node can receive a packet from 
any other node in the network with a nonzero probability, the average stopping time is of order 
O(N) when network coding is applied. 

proof: Appendix. 

Above, the corollary is consistent with the result of the data dissemination in a wired network 
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when network coding is adapted. The authors in [19] show that the stopping time increases 
linearly with the size of the wired network (i.e., it is of order O(N)) if the network is fully 
connected; i.e., each node is connected to every other node. 

A wireless network G(V, E) is sparsely connected when the network is connected and every 
node is in transmission range of at most two other nodes. An example of a sparsely connected 
wireless network is a linear network when each node can only communicate with its close 
neighbors. 

Lemma 4. In a sparsely connected wireless network, the average stopping time is of order 
0(N 2 ) when network coding is applied. 

proof: Appendix. 

Clearly, Lemma 3 gives the best achievable time 5 (smallest number of time slots), and Lemma 
4 gives the worst time (largest number of time slots needed). In other words, the above two 
lemmas show that for data dissemination in a wireless network with N nodes, the average 
stopping time is between O(N) and 0(N 2 ), independent of the underlying reception probability 
of the nodes. 

VI. Performance Evaluation 

In this section, we present results from a system simulation conducted using MATLAB R2008b 
that conforms to the data dissemination model described. The objective is to explore tightness of 
our stopping time upper bound, given in Eq. (15), for data dissemination in a wireless network 
with general topology and interference-free random access scheduler (i.e., only one node captures 
the channel and transmits, while all others listen). At the physical layer, we use QAM modulation 
with no channel coding. The results reported are based on simulations conducted for two simple 
and useful topologies: regular linear and 2-D grid. Such structured topologies help with better 
understanding of the model behavior as the number of nodes increase. 

A. Reception Probability 

We assume that all nodes use the same transmission power V and modulation scheme to 
broadcast packets. The wireless channel undergo Rayleigh envelope fading, and the path loss 

5 especially when p=l 
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(b) 

Fig. 1. Example of a network with (a) linear topology (b) grid topology 

exponent is 77. Assume node u sends a packet to node v which is d(u, v) far away. Let s U:V be the 
received power at v in the clear channel (i.e., no interference). Then s UjV follows an exponential 
random variable with mean V ■ d(u, v)~ a given by the following probability distribution function 

(pdf): 

1 s 
f( s u,v) = ^77 Tzzexp(--— — -). (16) 

Node v, the receiver, can decode successfully the packet transmitted by u if its received 
Signal-to-Noise (SINR) ratio exceeds a threshold 6 : 

P u ,v = Pr( S -^>z), (17) 

yvo 

The Ao is the variance of additive white Gaussian noise, assumed to be 4 x 10~ 14 at all the 
receivers 7 , and z is the capture reception threshold whose value depends on channel coding and 
modulation. 

B. Linear Grid 

Here, nodes are located in a line, with equal distance between neighbors. At first, we let the 
transmit power remain fixed and increase the size of the network by adding more nodes. Figure 

6 As mentioned before, we are assuming an interference free communication 

7 Noise Power is calculated for the bandwidth of 10MHz and in temperature 300K. This value, however, does not affect the 
result of the simulation. 
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Fig. 2. Both the analytical upper bound and the simulation result are increasing linearly at first, and after N — 30 they start 
increasing non-linearly versus the number of nodes in the network, d — 30, V = 20 x 10 -6 , A/o = 4 x 10 -14 . 



2 presents the simulation result and the analytical upper bound for the linear network. As one 
can see, the upper bound given in Eq. 15 closely follows the trend of the simulation results. 

When there are only a few nodes in the network, stopping time has a linear relation with 
the number of nodes in the network. However, when the size of the network keeps increasing, 
the linear relation is not valid anymore. This is consistent with our findings in Lemmas 3 and 
4. For a small number of nodes in the network, nodes are in transmission range of each other; 
i.e., a transmitted packet is heard by all of the nodes in the network (with nonzero probability 
P U v > 0); hence, the stopping time is O(N). On the other hand, when the size of the network 
keeps expanding, after a while we have P uv = for some nodes in the network and that affects 
the trend of the dissemination delay. In Figure 2, after N = 30, the stopping time (from both 
the simulation and analytical result) starts to increase nonlinearly. In fact, one can see that the 
stopping time is 0(N 2 ) after N = 30. 

Finally, we change transmission power to see its effect on dissemination latency and the 
accuracy of the upper bound in Eq. (15). The result is presented in Figure 3. Clearly, decreasing 
transmission power reduces nodes' coverage and results in increased stopping time. However, 
the relation between the stopping time and the transmission power is very interesting and is sort 
of hidden in Eq. (15). 

For a fixed network, we start with OdB power and decrease it to — AOdB. At first, nodes are in 
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Fig. 3. Analytical upper bound and simulation results versus nodes' transmission power for linear network, A/o = 4 x 10 -14 . 

transmission range of each other. In that case, the relation between the transmission power and 
the stopping time is linear. However, after a point, nodes start falling out of the coverage range 
of each other. When that happens, stopping time starts increasing nonlinearly with transmission 
power. Although our formula in Eq. 5 does not show this relation directly (counter to the result 
with size of the network), it has the same trend as the simulation results. 



C. 2-D Grid 

In a 2-D grid topology, nodes are located on a equispaced 2-D lattice as depicted in Figure 
1-b. As for the linear network, we first let the transmission power remain fixed while increasing 
the size of the network by adding more nodes. Figure 4 presents the simulation result and the 
analytical upper bound for the grid network with five different sizes. 

Inanmxn grid network with equispaced d, the distance between every two nodes is less 



than or equal to d\/m 2 + n 2 , which happens to be smaller than the transmission range of all the 
nodes in our simulation. In other words, for the fixed transmission power, each node can hear 
from all other nodes with nonzero probability. It is for this reason that the stopping time has a 
linear trend with the size of the network (Theorem 3). 

Finally, for different transmission power, analytical upper bounds and simulation results are 
presented in Figure 5. As we argued for the network with linear topology, although the relation 
between the transmission power and the dissemination latency can not be directly inferred from 
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Fig. 4. Analytical and simulation results when size of network is changing and nodes' transmission power is fixed for grid 
network, d = 30, V = 20 x 10~ 6 , Ao =4x 10~ 14 . Since nodes are in coverage range of each other dissemination latency 
increases linearly as we show in Theorem 3. 
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Fig. 5. Analytical upper bound follows the simulation results trend when nodes' transmission is changing power for grid 
network, A"o = 4 x 10~ 14 . 



the proposed upper bound in Eq. (15), it follows the simulation results by the same trend. 

D. Advantage of Network Coding 

In this subsection we compared the dissemination latency with network coding using computer 
simulation to a baseline random non-NC algorithm. The non-NC approach - termed random 
selection - operates as follows: whenever a node captures the channel it randomly selects an 
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TABLE I 

Stopping Time for dissemination algorithm in a linear network with and without network coding. 



d = 30, V = 20 x 10 



-6 



# nodes 


23 


27 


30 


35 


NC-based 


84.46 


100.94 


121.54 


157.59 


random-selection 


1189.6 


2180.8 


3148.9 


3713.4 



TABLE II 

Stopping Time for dissemination algorithm in grip network with and without network coding, we have 

d = 30, V = 20 x 10~ 6 



# nodes 


4x4 


4x5 


5x5 


6x4 


6x6 


NC-based 


59.34 


78.88 


90.4 


110.5 


130.72 


random-selection 


684.3 


1044.4 


2275.8 


2836.7 


3724.4 



information message from its buffer and broadcasts the selected packet. Table I compares the 
mean time needed to diffuse data in a linear network with network coding vs. the random 
selection algorithm. The performance of the random selection algorithm and NC-based algorithm 
for a 2-D grid topology is compared in Table II. As expected, network coding based dissemination 
outperforms the random selection algorithm by an order of magnitude. The reception probabilities 
are calculated as explain in Section V. 



VII. Conclusion 

In an all-to-all communication within a wireless network, we study the use of using network 
coding was studied in this paper. In a wireless network with general topology, we provide an 
analytical upper bound for the amount of time needed to spread information through the whole 
network. Our result show that by using network coding the stopping time is between O(N) and 
0(N 2 ) where N is number of nodes inside the network. Our simulation results indicates that 
our proposed upper bound follows the trend of the stopping time in networks with linear and 
grid topologies. 

Our result here is limited to the following assumption, only one (random) node is transmitting 
in each time slot. In other words, we are assuming an interference-free scheduler. This assumption 
is valid for small networks in which nodes are in carrier sensing coverage of each other. 
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CSMA/CA-type MAC in such networks are pretty close to interference free thanks to collision 
avoidance protocol. However, this assumption no longer is valid in large networks. In fact, the 
ability of (multiple) simultaneous transmissions (called spatial reuse) is one of the advantages 
of wireless networks which is ignored in our result. Our future work focuses on extending our 
result here to networks with spatial reuse. 
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Appendix A 
Proof of Theorems 

Lemma 1: At t — %, the probability that any two nodes (e.g. u,v) have the same subspace 
can be bounded from above as follows: 

N ~ l ' N - ' 

P(S U (%) = S v (%)) < V minC- , - )P(dim(S v (t)) =k + l). 

z — ' k N — k — 1 

fc=i 

Proof: For the sake of simplicity, we will write S u (Ti) and S v (7i) as S u and S v respectively. 
By using law of total probability we have: 

iV-l 

P(S U = S V ) = J2 p ( S ^ = S v \dim(S v ) = k + l)P(dim(S v ) = k + 1). (18) 

fc=i 

The reason that in the summation k starts from one instead of zero is that P(S U = S v \dim(S v ) = 
1) = 0. That is because we assume nodes initially contain different information packets. To 
calculate P(S U = S v \dim(S v ) — k + 1), let define E v as the set of all nodes (including v itself) 
which spans the same space as S v : 

E v = {u\S v = S u }. (19) 
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Since we know dim(S v ) — k + 1, each member of E v has dimension k + 1, which means we 
have total dimension increase of at least |-E„|/c. At time t — %, dimension of whole system is 
at most by i(N — 1) by definition of % in Eq. (10). Therefore the following inequality holds: 

\E v \k<i(N -1), (20) 

which implies 

\E v \<i(N-l)/k. (21) 

On the other hand, at time t = % nodes need to capture N(N-l)-i(N-l) = (N -i)(N -1) 

dimension to stop. Node is set E v , each need N — k — 1 dimension increase. So the total amount 

of dimension increase nodes in E v require is \E V \(N — k — 1) which cannot be greater than total 

amount of needed dimention at time %. Hence the following inequality holds: 

(JY-i)(JY-l) 
lKl ~ (N-k-1) (22) 

Since \E V \ satisfies both inequalities in Eq. (21) and Eq. (21), it satisfies the following: 

|*|<(W-l)nrfn( 1 ^=i_,i). (23) 

Now we can calculate P(S U = S v \dim(S v ) — k + 1). We know that \E V \ nodes have the same 
dimension as node v. Hence a randomly selected node u (from N — 1 posibilities) has the same 

rp I 

dimension of node v with probability j^zj. Therefore, 

P(S U = S v \dim(S v ) = k + l)<min I - _ fc _ 1 , ^ J • (24) 

By substituting these results in Eq. (25), we have the following inequality: 

N ~ 1 / N-i i\ 
P(S U (%) = S V {17)) < Y, min ( N _ k _r k ) P ^ dim ^ = k + 1 )- C 25 ) 

fe=i ^ ' 



Lemma 2: Probability that node u has dimension k at time % is given by: 

P(dim(S u (%)) = k) = 

Y. 3 =o(-^) 3 C{N - l,j)C(i(N -l) + N-2-jN-k,N-2) 
E J -=o(- 1 ) J ' C '(^ JMW -l) + N-l-jN,N-l) 
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Proof: The probability of node u having dimension k + 1 at % is equivalent to the probability 
of a dimension increase of k for node u at %. At time %, by definition, the overall dimension 
increase among all nodes is i(N — 1). So, the problem is from the i(N — 1) increased dimension, 
which is the probability that fc goes to node w given that the dimension increase of each node 
cannot be greater than N — 1. This is a traditional ball and bins problem. 

All possible outcomes of the ball and bins problem must have the same probability of 
occurrence. Here, all possible outcomes have the same probability because of the following 
assumptions: 

• Nodes are equally lucky in capturing the channels and transmitting in each time slot. 
P — P 

• In the initial state (beginning of dissemination), every node has one and only one information 
packet. 



Theorem 1: For < % < N — 2: 

N-l 



E[7~ +1 - 7~] < — 



(E u , v ev p uv)(l ~ P(Su(T w ) = S v (T i+1 ))) 



2N 



and for i = N — 1 



E [r N -r K _,]< ^•- 1 ) 



Proof: Suppose we are in time slot t where t E [71, 7*+i). By definition, D(t) presents a 
dimension increase at time t. At time slot t, node u would capture the channel and start trans- 
mitting with probability 1/JV. By our assumption, each node transmits one unit of information 
at each time slot; thus, when node u transmits, the dimension of node v would increase by 1 if 
it receives the packet transmitted by u (with probability P uv ) and if the packet contains at least 
one dimension that is not in v. This can be written as follows: 

E[D(t+l)—D(t)] = /J — /^ j P U yP (packet contains at least one dimension which is not in v). 

(26) 
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The second probability in above equation can be calculated using the following two events: 

1) Set S u should not fall into set S v or mathematically speaking S u (t) <£ S v (t). 

2) Given above, the NC coefficients must be selected in such a way to cover some dimensions 
from S u (t) which are not in S v (t). 

The second event above is calculated in [19] to be: 

1 - (i)l(^W n ^W)l, given S u (t) £ S v (t), (27) 

Q 

where S' v (t) is complement of S v (t) and the finite field is F 29 . Hence given u is transmitting a 
packet, we have: 

P (packet contains at least one dimension which is not in v) (28) 

= P(5 u (t)^5„(t))(l-(-) l(S » ( ' )nfliW)l ). 
Thus the average dimension increasing can be calculated as follows: 

E[D(t + 1) - D(t)] = J2 Jf E P uvP{Su{t) £ S v (t))(l - (1)I^(*)^W)I), (29) 

We assume - tv which is practical assumption. For example if we assign only 8 bits to NC, 
~ = 256 = -004. In that case we can rewrite above equation as: 

E[D(t + 1) - D(t)} = J2^J2 p ™ p ( s «(t) ^ S v (t)). (30) 

In above summation, for any two nodes u,v E V, we have a summation term P(S u (t) <£. 
S v (t)) + P(S v (t) £ S u (t)). Clearly, {S u {t) £ S v (t)} C {S u (t) £ S v (t)} U {S v (t) £ S t (t)} 
which results in the following inequalities: 

P(S u (t) £ S v (t)) + P(S v (t) £ S u (t)) > P(S v (t) £ S u (t)) = 1 - P(S v (t) = S u (t))\/u, veV. 

(3D 
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Thus Eq. (30) can be rewritten as follows: 

E[D(t + l)-D(t)] = J2jiJ2 P ™ P ( S ^£ S vW {32) 

u v,v>u 

\ u v,v>u 

+EE Pvu(i - p(s v (t) = s u (t))) 

V u,u>v 
u,v 

where equality (1) is drived by renaming the sum indexes and the last equality is the result of 
having P uv = P vu and P(S v (t) = S u (t)) = P(S u (t) = S v (t)). 

We know that by definition .0(71) = i(iV - 1) so we can add up above statement for t E 
[7i, 7i+i): 

E W + l)-D(i)l4(E P ») E 0--P(S u (t) = S v (t)))- (33) 

te[Ti,Ti+i) u,vev te[Ti,T i+ i) 

In above derivation we also used this fact that P(S u (t) = S v (t)) does not depend on u and v 
as given by Lemma 1 and Lemma 2. 
By definition of % and D(t) we have: 

E E[D(t + 1) - D(t)] = N - 1. (34) 

te[Ti,T i+ i) 
Now we consider two possibilities for i. 

l)ifO<?<iV — 2 then Obviously we have: 

P(S u (T l+1 ) = S v (T i+1 )) > P(S u (t) = S v (t))Vt e [%, T l+1 ). (35) 

By above inequality and equalities Eq. (33) and (34) we have: 

7V " 1 = ^(E P ™) E (1 - P(S u (t) = S v (t))) (36) 

u,v€V te[Ti,Ti+i) 



* ^(E p ™) E (1 - P(Su(T t+1 ) = S v (T l+1 ))) 

u,v&v te[Ti,T i+ i) 

^-( E Puv)(l - P(S u (T l+1 ) = S v (T i+1 )))(m+i - U). 



2N' 

u,v£V 
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and by rearranging the terms we get: 

N- 1 

2n(52u,v&V P uv)(l - P{S u {Ti+l) = 5^(71+1))) 

2) if i = N — 1, then by definition, for all u, v E V S u (Tn) = S v (Tn) which doesn't help us 
to obtain a bound on E[7i+i — T] = E[Tn — Tn-i]- For that let consider a time slot before Tn', 
i.e., £ = Tn — 1. In that time slot the following two statements are valid 

• At least one of the nodes does not have the whole space; i.e. 3u E Vs.t.\S u (t)\ < N 

• At least one of the nodes does have the whole space; i.e. 3v E Vs.t.\S v (t)\ = N 
From the above statements we can conclude: 

P(S u (t) = S v (t)) < ^^ for t E [T N -!, Tn). (38) 

By substituting above equation in (30) and following the same argument as we did for the 
first case we will have: 



E[T N - T N -!] < x^ % ■ (39) 



Theorem 2: Let T be stopping time. Then 

E[71 = E[T N ] 



N-l 



N-l 

2A? l^u,v£V ^ uv V i=1 1 — P\pu\Ti) = o v {Tij) 



£ ■ ■ -- '- 'S t - - - ^ 



Proof: 

E[T] = E[T N ] (40) 



A? 



i=l 
N-l 

^(E[7- - Ti-i}) + E[T N - T N -!] 



i=i 



N-l f™ 1 



" m Eu, vev Puv { ^ (1 - P(Su(ld = S V (W) + N 
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Corollary 3: In a fully connected wireless network in which every node can receive packet 
from any other node in the network with a non-zero probability, the average stoping time is of 
order 0(N) when network coding is applied. 

Proof: First let prove the following lemma: 

Lemma 5. There exist a finite number /3 such that the following is held: 

N-l l 

UJ (i - p(s u (%) = s v (%))) ~ N(3 (41) 



Proof: By definition of the stopping time T, P(S u (t) = S v (t)) < 1 for t < T. Let a be the 
largest number less than 1 such that P(S u (t) = S v (t)) < a < 1. Then 

^ 1 1 ^ N-l 

n 

We are assuming that nodes can receive packets from each other with probability at leat p. In 
other words, each node is in transmission rage of all other nodes and p is the smallest reception 
probability between every two nodes. Note that for p — 1, the wireless network acts like a wired 
network with a complete graph. Mathematically speaking: 

P uv >pVu,veV. (43) 

Using the above equation we have: 

J2 P uv>N 2 - Y,P = N 2 -p (44) 

Using the above equation and Lemma 5, Eq. 15 can be bounded as: 

N — 1 f Nl 1 \ 



I - 



< ^L-1(NP + N) = ^(N-1)(1 + P). 



2 

Therefore, in this case time is of order O(N). 
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Corollary 4: In a sparsely connected wireless network the average stoping time is of order 
0(N 2 ) when network coding is applied. 

In this case each node only receives packets from at most two other nodes. Let assume p > 
be the smallest reception probabilities for all the nodes inside the network. Note that p can not 
be zero because we are assuming the network is connected which means every node receives 
data with non-zero probability from at leat one of its neighbors. In other words: 

MueV 3v EV s.t. P vu > p > 0, P zu = VzeV - {v} (46) 

By above formula we have: 

J2 p uv > Np (47) 

u,v£V 

By substituting above inequality and the result of Lemma 5 in Eq. 15 we have: 

N-l f N ~ l 1 \ 

E|T] ^ XT^^IS (1-^(71)^(7-))) +iV ) (48 > 

< ^^(N(5 + N) = -(N-l)N{l + f3). 

2 V 



Therefore, in this case time is of order 0(N 2 ). 
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